Representation, Approximation and Learning of Submodular Functions Using Low-rank Decision Trees
نویسندگان
چکیده
We study the complexity of approximate representation and learning of submodular functions over the uniform distribution on the Boolean hypercube {0, 1}. Our main result is the following structural theorem: any submodular function is -close in `2 to a real-valued decision tree (DT) of depth O(1/ ). This immediately implies that any submodular function is -close to a function of at most 2 ) variables and has a spectral `1 norm of 2 2). It also implies the closest previous result that states that submodular functions can be approximated by polynomials of degreeO(1/ ) (Cheraghchi et al., 2012). Our result is proved by constructing an approximation of a submodular function by a DT of rank 4/ 2 and a proof that any rank-r DT can be -approximated by a DT of depth 5 2 (r + log(1/ )). We show that these structural results can be exploited to give an attribute-efficient PAC learning algorithm for submodular functions running in time Õ(n) · 2 4). The best previous algorithm for the problem requires n ) time and examples (Cheraghchi et al., 2012) but works also in the agnostic setting. In addition, we give improved learning algorithms for a number of related settings. We also prove that our PAC and agnostic learning algorithms are essentially optimal via two lower bounds: (1) an information-theoretic lower bound of 2 ) on the complexity of learning monotone submodular functions in any reasonable model (including learning with value queries); (2) computational lower bound of n ) based on a reduction to learning of sparse parities with noise, widely-believed to be intractable. These are the first lower bounds for learning of submodular functions over the uniform distribution.
منابع مشابه
Some Results about the Contractions and the Pendant Pairs of a Submodular System
Submodularity is an important property of set functions with deep theoretical results and various applications. Submodular systems appear in many applicable area, for example machine learning, economics, computer vision, social science, game theory and combinatorial optimization. Nowadays submodular functions optimization has been attracted by many researchers. Pendant pairs of a symmetric...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملNear Optimal algorithms for constrained submodular programs with discounted cooperative costs
In this paper, we investigate two problems, namely (a) minimizing a submodular cost function under combinatorial constraints, which include cuts, matchings, paths, etc., and (b) optimizing a submodular function under submodular cover and submodular knapsack constraints. While both problems have hardness factors of Ω( √ n) for general submodular cost functions, we show how we can achieve constan...
متن کاملCOMS 6253 : Advanced
Previously: • Administrative basics, introduction and high-level overview • Concept classes and the relationships among them: DNF formulas, decision trees, decision lists, linear and polynomial threshold functions. • The Probably Approximately Correct (PAC) learning model. • PAC learning linear threshold functions in poly(n, 1/ , log 1/δ) time • PAC learning polynomial threshold functions. Toda...
متن کاملSubmodular Optimization for Efficient Semi-supervised Support Vector Machines
In this work we present a quadratic programming approximation of the Semi-Supervised Support Vector Machine (S3VM) problem, namely approximate QP-S3VM, that can be efficiently solved using off the shelf optimization packages. We prove that this approximate formulation establishes a relation between the low density separation and the graph-based models of semi-supervised learning (SSL) which is ...
متن کامل